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a t; F v TH pnir> ^RHPP FT -T.TKF. FACTORS AND METHODS OF USE 

Rar.kf rround o f the Invention 
This invention relates to gene therapy methods for treating 

5 hemoglobinopathies. 

The major form of adult human hemoglobin, HbA, consists of a tetramer 
of two a-globin chains and two p-gidbin chains (<x 2 p 2 ). Hemoglobinopathies, 
such as sickle cell anemia and P'-thalassemia, are caused by a failure to 
produce normal levels of P-globin. Hemoglobin A, (HbA 2 ), which consists of a 
1 0 tetramer of two a-globin chains and two 6-globin chains (cc 2 6 2 ), is produced in 
low amounts in most sickle cell patients and in normal adults (2-3% of total 
hemoglobin) (Steinberg et al., Blood 78:2165, 1991). HbA 2 is a potent 
inhibitor of the sickle hemoglobin (HbS; cc 2 P s 2 ) polymerization characteristic of 
sickle cell anemia (Nagel et al., Proc. Natl. Acad. Sci. USA 76:670, 1979), and 
1 5 shares some functional activity with HbA. 

Svmmarv nf fhe lnvention 
We have shown that modification of the 6-globin gene promoter to 
include a binding site for the erythroid kriippel-like factor (EKLF) polypeptide 
that binds to the P-globin gene promoter (hereinafter "p-EKLF") results in 
20 increased expression from the 6-globin promoter. A modified P-EKLF 

(hereinafter "6-EKLF") that binds to the wild type 6-globin gene promoter can 
thus be used to induce 6-globin expression. 

Accordingly, in one aspect, the invention features a method of inducing 
6-globin gene expression in a cell, such as an erythrocyte precursor cell (e.g., 
25 an erythrocyte burst-forming cell (BFC-E) or an erythrocyte colony-forming 
cell (CFC-E)) or an erythrocyte. In this method, a nucleic acid encoding a 6- 
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EKLF polypeptide is introduced into the cell, or a precursor of the cell. Cells 
into which the nucleic acid can be introduced include erythrocyte precursors, 
such as BFC-E and CFC-E. Preferably, the nucleic acid is introduced into a 
pluripotent hematopoietic stem cell, which is capable of self renewal, and thus, 
5 in the context of gene therapy methods (see below), minimizes the number of 

treatments required. 

The invention also includes the use of a nucleic acid encoding a 6-EKLF 
polypeptide in the preparation of a medicament for inducing 6-globin gene 
expression in a cell. 

10 The cell into which the nucleic acid is introduced can be in a mammal or 

can be a cell that has been removed from a mammal for introduction of the 
nucleic acid, after which the cell, or progeny thereof, are introduced into a 
mammal. Typically, this method is carried out for a human patient. In one 
particular example, the nucleic acid can be introduced into a hematopoietic 
15 stem cell that has been obtained from a patient. Induction of 6-globin gene 
expression by this method can be used in gene therapy methods for the 
treatment of hemoglobinopathies, such as sickle cell anemia and p- 
thalassemia. 

The invention also features 6-EKLF polypeptides, which are identical to 
20 P-EKLF, except that they contain one or more modifications that permit them 
to bind to double stranded DNA containing the sequence 5'-TGA AAC CCT-3' 
or the sequence 5'-CTA ATG AAA-3'. The modifications that generate 6- 
EKLF polypeptides are generally in a DNA-binding amino acid of a zinc finger 
of the P-EKLF polypeptide. For example, modifications can be made in any of 
25 amino acids -1 and/or 2-6 in any of the three P-EKLF zinc fingers (see below 
and Fig. 2). 

Preferably, the 6-EKLF polypeptide of the invention is in a substantially 
pure preparation. By "substantially pure" is meant a preparation that is at least 
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60% by weigh. «ky weight) a 5-EKLF polypeptide. Preferably the preparation 
is at least 75%, more preferably at least 90%, and most preferably at least 99%, 
by weight a 8-EKLF polypeptide. Purity can be measured by any appropriate 
method, e.g., column chromatography, polyacrylamide gel electrophoresis, or 

5 HPLC analysis. 

Also featured in the invention is a nucleic acid, such as a nucleic ac.d 
containing deoxyribonucleotides (DNA), ribonucleotides (UNA), or 
combinations or modifications thereof, encoding 6-EKLF polypeptides. 
Preferably, the nucleic acid is in the form of purified DNA. By purified 
0 DNA" is meant DNA that is not immediate^ contiguous with both of the 
coding sequences with which it is immediately contiguous (one on the 5' end 
and one on the 3' end) in the natural.y occurring genome of the orgamsm from 
which i, is derived. The term thus includes, for example, a recombinant DNA 
m olecu.e that is incorporated into a vector, into an autonomously repl.cat.ng 
,5 plasmid or vims, or into the genomic DNA of a prokaryote or eukaryote, or 
which exists as a separate mo.ecule (e.g., a cDNA or a genomic DNA fragment 
produced by PGR or restriction endonuclease treatment) independent of other 
sequences. It also inc.udes a recombinant DNA molecu.e that is pari of a 
hybrid gene encoding additional polypeptide sequence. 

The invention also includes cells, such as hematopoietic stem cells or 

ii nvr F or CFC-E that contain nucleic acids 
erythrocyte precursor cells, e.g., BFC-fc or l. e, in 

encoding 5-EKLF polypeptides. 

Vectors containing nucleic acids encoding 6-EKLF polypeptides are 
a,so included in the invention. The vectors of the invention include those that 
can be used in gene therapy methods for treating hemoglobinopathies, such as 
sickle cell anemia and P Mhalassemia. For example, adeno-associated v.ra. 
(AAV) vectors and retroviral vectors (e.g., moloney murine leukemra v,ra. 
vectors) can be used in the invention. Vectors that can be used for amphfymg 
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nucleic acids encoding 6-EKLF polypeptides in bacteria are also included in 
the invention. Preferably, the nucleic acids encoding 6-EKLF polypeptides are 
operably linked to a promoter, for example, the P-globin promoter, or a non- 
tissue specific promoter, such as the Cytomegalovirus promoter. By "operably 
5 linked" is meant that a gene and a regulatory sequence(s), such as a promoter, 
are connected in such a way as to permit gene expression when the appropriate 
molecules (e.g., transcriptional activator proteins or proteins which include 
transcriptional activation domains) are bound to the regulatory sequence(s). 
The invention also features methods for identifying 6-EKLF 
10 polypeptides. In these methods, a nucleic acid containing either the sequence 
5-TGA AAC CCT-3' or the sequence 5'-CTA ATG AAA-3' is contacted with a 
candidate polypeptide that has been modified such that it differs from wild type 
P-EKLF by at least one amino acid, for example, an amino acid in a P-EKLF 
zinc finger that binds DNA. 6-EKLF polypeptides are then identified by their 
1 5 ability to bind to this nucleotide sequence. 

Expression of therapeutic levels of gene products in gene therapy 
methods is often difficult to achieve. Transcription factors are generally 
reqmred in lower quantities than other types of therapeutic gene products. 
Thus, an advantage of the present invention is that it provides a gene therapy 
20 method involving expression of a transcription factor, 6-EKLF. 

Other features and advantages of the invention will be apparent from the 
following detailed description, the drawings, and the claims. 

T4p^ f Descrip tion 0 f thR drawings 
Fig. 1A is a schematic representation of the P-globin locus on human 

25 chromosome 11. 

Fig. IB is a schematic representation of a comparison of portions of the 
human 6- and p-globin promoters, and modifications of the 6-globin promoter 
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to contain P-EKLF binding sites. The p- and 6-globin promoters were aligned 
using the GAP program of the GCG/Wisconsin package, and the regions of 
interest are shown. The consensus P-EKLF binding site in the P-globin 
promoter (CCACACCCT) is located at nucleotide positions -85 to -93 from the 
transcription start site and this sequence is underlined. The wild type 6-globin 
promoter region has a partial EKLF binding site (ACCCT) at nucleotide 
positions -81 to -85. To create the modified 6 CAC -' promoter, site-directed 
mutagenesis was used to convert the partial P-EKLF binding site in the 6- 

globin promoter to a consensus p^cr^r uiiiuiiig »uo. n.o ~ K — 

contains the P-globin EKLF binding site at nucleotide positions -85 to -93 of 
the 6-globin promoter, which is at the same distance from the transcription start 
site as the p-globin promoter EKLF binding site. 

Fig. 2 is a schematic representation of the amino acid sequences of p- 
EKLF zinc fingers 1-3. Amino acids that can be randomized for selecting 6- 
1 5 globin promoter binding variants include those that are boxed. 

Fig. 3A is a schematic representation of 6-globin promoter/reporter 
plasmids. The 1 .5 kb Kpn\-Bgll\ LCR HS2 fragment was linked to the -265 to 
+48 wild type p-, 6-, and modified 6-globin promoters described in the above 
description of Fig. IB. These promoters were used to drive lucifcrase reporter 
20 gene expression in transiently transfected MEL cells. Luciferase activity was 
normalized to p-galactosidase expression from an internal control CMV-p- 

galactosidase plasmid. 

Fig. 3B is a graph showing the results of transient expression of HS2 6- 
promoter/Luciferase plasmids in mouse erythroleukemia (MEL) cells. HS2 
25 p/Luc expression was normalized to 1 00%. Inclusion of the consensus p- 
EKLF binding site in the 6 CAC -' and 6 CA ° 2 promoters increased HS2 6/Luc 
expression from 10% to 56% and 99% of HS2 p/Luc, respectively. 
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Fig. 4 A is a schematic representation of HS2 6-p constructs. The 
parental HS2 6-P construct contained a 1 .9 kb Kpn\-Pvu\\ HS2 fragment, a 3.0 
kb 6-globin gene fragment beginning at nucleotide -400 from the transcription 
start site, and a 2. 1 kb p-globin gene fragment beginning at base -815 from the 
5 transcription start site. The corresponding 6 CAf "' and 6 CAC " 2 modifications were 
inserted to create HS2 6 CAC, -P and HS2 6 CAC " 2 -p. These constructs were 
transfected into MEL cells with a pgk-neo selectable marker and G418 resistant 
populations were selected. The cells were induced to differentiate with DMSO, 
and RNA was extracted for analysis. 
1 0 Fig. 4B is a photograph of denaturing gel analysis of the expression of 

modified 6-globin genes in MEL cells stably transfected with HS2 6-p 
constructs. The relative levels of 6- and p-globin mRNAs were determined by 
the Single Nucleotide Erimer Extension (SNuPE) assay (Szabo et al., Genes 
Dev. 9:1857, 1995) and quantitated by phosphorimagc analysis. Inclusion of 
15 the consensus p-EKLF binding site in the 6 CAC -' and 6 CA(: - 2 globin genes 

increased 6-globin mRNA levels from 3.6% to 24.7% and 34.8% of total 6+p 
mRNA, respectively. 

Fig. 5A is a schematic representation of the reporter plasmid HS2 6 GAU - 
p. This construct contains a consensus GAL4 binding site (expanded), 
20 replacing 6-globin promoter nucleotides -80 to -99. 

Fig. 5B is a schematic representation of pClneo effector constructs 
expressing either the GAM DNA binding domain (amino acids 1-147) alone or 
GAL4 (l . 147) fused to the P-EKLF activation domain (amino acids 2-275). 
Fig. 5C is a photograph of denaturing gel analysis showing that 
25 GAL4/P-EKLF activates 6-globin gene expression in an HS2 6 GAL4 -P construct. 
HS2 6 GAL4 -p was stably transfected into MEL cells with either pCIneo-GAL4 (l . 
, 47) or pClneo-GAL4 (1 .,47/P-EKL F > and relative transcript levels were 
determined by the SNuPE assay. The level of human 6/6+ was 4.4% in cells 
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containing HS2 6-P, 13.3% in cells containing HS2 6 GAL *-P plus pCIneo- 
GAL4 (M47 /P-EKLF. These results demonstrate that a modified p-EKLF can 
significantly increase 6-globin gene expression. 

Fig. 6 is a schematic representation of a model of 6-EKLF activation of 

5 6-globin gene expression. The 6-globin gene is normally expressed at a low 
level in erythroid cells. One reason for low 6-globin expression is the defective 
CACCC box at -90 in the 6-globin gene promoter. A modified P-EKLF, 6- 
EKLF, which contains zinc fingers designed to bind specifically to the 
defective 6-globin CACCC box, can be used in gene therapy approaches to 

1 0 increase HbA, levels and inhibit erythrocyte sickling. 

Detailed Description 
As discussed above, the major form of adult human hemoglobin, HbA, 
consists of a tetramer of two a-globin chains and two P-globin chains (a 2 p 2 ). 
Hemoglobinopathies, such as sickle cell anemia and pMhalassemia, are caused 

15 by a failure of erythrocytes and erythrocyte precursors to produce normal levels 
of P-globin. Also as discussed above, hemoglobin A, (HbA 2 ), which consists 
of a tetramer of two a-globin chains and two 6-globin chains (a 2 6 2 ), is 
produced in low amounts in most sickle cell patients and normal adults (2-3% 
of total hemoglobin). HbA 2 has potent anti-sickling activity and shares some 

20 functional activity with HbA. Thus, induction of increased levels of 6-globin, 
in order to facilitate generation of increased levels of HbA 2 , can be used to treat 
hemoglobinopathies, such as sickle cell anemia and P°-thalasscmia. 

The CACCC box at nucleotide positions -93 to -85 in the P-globin gene 
promoter is important for P-globin gene expression (Dierks et al., Cell 32:695, 

25 1983; Myers et al., Science 232:613, 1986; Cowie et al., Mol. Cell. Biol. 

8:3122, 1988). The major protein that binds to the CACCC box of the p-globin 
promoter is the erythroid-specific, zinc finger transcription factor EKLF (Miller 
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et al., Mol. Cell. Biol. 13:2776, 1993). P-EKLF contains three zinc fingers, 
which bind to the successive DNA base triplets, 5'-CCA CAC CCC-3' 
(nucleotides -93 to -85 with respect to the p-globin gene transcription start ^ 
site), of the p-globin CACCC box. In conjunction with proteins bound to the 
5 locus control region (LCR), EKLF binding to the P-globin CACCC box 

activates high levels of p-globin gene expression (Donze et al., J. Biol. Chem. 
270:1955, 1995; Nuez et al., Nature 375:316, 1995; Perkins et al., Nature 
375:318, 1995). 

The human 6- and P-globin genes are located at the 3' end of the p- 
10 globin locus on chromosome 1 1 (Fig. 1 A). Comparison of the sequences of the 
6- and p-globin proximal promoters (Fig. IB) reveals that the 6-globin 
promoter lacks consensus CACCC and CCAAT boxes, which are present in the 
P-globin promoter and are important for adult p-globin gene expression (Dierks 
et al., Cell 32:695, 1983; Myers et al., Science 232:613, 1986; Cowie et al., 
15 Mol. Cell. Biol. 8:3122, 1988). 

Sequence comparisons and analyses of crystal structures of other zinc 
finger-containing proteins bound to their DNA binding sites have revealed that 
amino acid positions in zinc finger a-helices that are involved in nucleotide- 
specific contacts are conserved. These conserved positions have been 
20 designated 

-1 , 2, 3, and 6, with respect to the beginning of the a-helix, and are shown for 
each p-EKLF zinc finger in Fig. 2. Positions 4 and 5 may also affect sequence- 
specific DNA recognition. 

The 6-globin gene promoter contains a partial p-EKLF binding site, 5'- 
25 TGA AAC CCT-3', at nucleotide positions -89 to -8 1 . p-EKLF does not bind 
to the 6-globin partial p-EKLF binding site. As is described further below, 
insertion of a p-EKLF binding site into the 6-globin promoter, at or near the 6- 
globin partial P-EKLF binding site, enhances expression from the 6-globin 
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promoter. Thus, modified P-EKLFs, which bind to 6-globin promoter 
sequences at or near the partial p-EKLF binding site and enhance transcription 
from the 6-globin promoter, can be used in methods to increase. 6-globin 
expression (Fig. 6). 

5 The invention provides modified P-EKLFs, designated 6-EKLFs, that 

contain altered zinc finger sequences that bind to, and enhance expression 
from, the 6-globin promoter. 6-EKLFs of the invention bind to, for example, 
nucleic acids containing the partial P-globin EKLF binding site (5'-TGA AAC 
CCT-3') of the 6-globin promoter. Also included in the invention are 6-EKLFs 

1 0 that bind to nucleic acids containing 6-globin nucleotides -93 to -85, which are 
at the same position in the 6-globin promoter, relative to the transcription start 
site, as the p-EKLF binding site in the P-globin promoter. Genes that encode 
6-EKLFs can be used in gene therapy methods for increasing HbA 2 production, 
which is useful in the treatment of hemoglobinopathies, such as sickle cell 

15 anemia and P°-thalassemia. 

PrnHtir.tjpp pf 6-F.KLFs 

6-EKLFs can be isolated, for example, using the phage display method 
(see, e.g., Rebar et ah, Science 263:671, 1994; Smith, Science 228:1315, 1985; 
Wu et al., Proc. Natl. Acad. Sci. USA 92:344, 1995; Jamieson et al., 

20 Biochemistry 33:5689, 1994; Choo et al., Nature 372:642, 1994; Choo et al., 
Proc. Natl. Acad. Sci. USA 91:11 163, 1994). In this method, a hybrid gene is 
made in which DNA encoding a peptide or protein domain of interest, such as a 
peptide containing a region of random sequences, is inserted at a site in the 
middle portion of the gene encoding the minor coat protein, pill, of 

25 bacteriophage fd. When expressed in the bacteriophage, the portion of the 

resulting fusion protein containing the peptide of interest is present at the tip of 
the phage capsid. Phage that thus display the peptide of interest can then be 
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used in selection methods for, e.g., identifying fusion proteins that contain 
regions that bind to specific DNA probes or antibodies. Selected phage can be 
amplified and sequence analysis of the selected random sequences can then be 
carried out. For use in isolating 6-EKLFs, the phage display method is carried 
out as follows. 

Amino acid positions -1 and 2-6 are randomized in each of the three p- 
EKLF zinc fingers (see Fig. 2; also see Miller et al., Mol. Cell. Biol. 13(5):2776 
for the nucleotide and amino acid sequences of P-EKLF), and variants that 
specifically bind nucleic acids containing 6-globin promoter sequences (e.g., 5 ! - 
TGA AAC CCT-3' or 5*-CTA ATG AAA-3') are selected. Because of 
limitations of the phage display method, preferably, one zinc finger is 
randomized and selected at a time. For example, beginning with the first P- 
EKLF zinc finger (see Fig. 2), overlapping oligonucleotides spanning the 
coding region of the zinc finger, and containing all four bases at the positions 
1 5 corresponding to amino acids -1 and 2-6, are annealed and extended with DNA 
polymerase (e.g, Sequenase 2.0, United States Biochemical) to create a library 
of DNA fragments that encode a mixture of all possible amino acid 
combinations at amino acid positions -1 and 2-6. These fragments are linked to 
the second and third zinc fingers of P-EKLF, and cloned into the vector RJSE5 
20 (Smith et al., Meth. Enz. 217:228, 1993). The resulting random library is then 
transformed into an appropriate E. coli strain, such as MC1061 (Smith et al., 
Methods Enzymol. 217:228, 1993), and a collection of bacteriophage are 
produced, which express a modified P-EKLF in their coat protein. In these 
modified p-EKLFs, the first zinc finger contains random sequences in amino 
25 acid positions - 1 and 2-6, and the second and third zinc fingers are wild type. 

Phage that display the modified p-EKLFs are then mixed with biotin- 
labeled double-stranded DNA fragments containing a modified P-EKLF 
binding site. Because the zinc fingers are being selected one at a time, the 
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modificd p-EKLF binding site has only one of the triplets modified. For 
example, in selecting the first zinc finger of 6-EKLF, the first triplet of the P- 
EKLF binding site is replaced with the corresponding 6-globin site. Thus, the 
DNA fragment contains 5'-CTA CAC CCT-3' instead of 5'-CCA CAC CCT-3', 

5 or 5'-TGA CTA GGG-3' instead of 5'-ACC CTA GGG-3'. Phage containing 
sequences that bind to the DNA probes can be detected by contacting the 
phage/DN A probe mixtures with streptavidin using, e.g., a biopanning method 
(see, e.g., Rebar et al., Science 263:671, 1994). 

Once the first zinc finger of a modified p-EKLF variant is selected 

10 which binds to the first 6-EKLF binding site triplet, then the above-described 
methods arc repeated to select second and third zinc fingers, which bind to the 
second and third 6-EKLF binding site triplets. When all three P-EKLF zinc 
fingers have been modified so that they each bind to their respective 5-EKLF 
binding site triplets, the DNA that encodes them is linked to the remainder of 

15 the p-EKLF protein coding sequence, to produce a nucleic acid encoding a 6- 
EKLF. 

In addition to phage display, other methods known in the art for 
selecting modified zinc fingers from libraries including random sequences can 
be used for isolating 6-EKLFs. For example, the challenge phage method can 
20 be used. Other standard methods are described, e.g., by Ausubel et al. {Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, 1994). 

Expression of 6-FKLF in He matopoietic Stem Cells 

Expression of 6-EKLF in erythrocyte precursor cells (e.g., erythrocyte 
burst-forming cells (BFC-E) and erythrocyte colony-forming cells (CFC-E)) 
25 and erythrocytes can be used to increase 6-globin production, and thus HbA 2 
levels, in these cells and their progeny. In order to accomplish this, a gene 
encoding a 6-EKLF polypeptide can be introduced into hematopoietic stem 
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celis, or erythrocyte precursor cells (e.g., BFC-E and CFC-E), using gene 
therapy methods known in the art. Preferably, 6-EKLF genes are stably 
introduced into hematopoietic stem cells, which are capable of self renewal, in 
order to minimize the number of treatments required. Any vectors that can be 
5 used to accomplish stable integration, such as adeno-associated viral (AAV) 
vectors and retroviral vectors (e.g., moloney murine leukemia viral vectors), 
can be used. Use of an AAV vector for introduction of a gene, such as a gene 
encoding a 6-EKLF polypeptide, into hematopoietic stem cells can be carried 
out as follows. This method is also described by Luhovy ct al. (Biology of 
1 0 Blood and Marrow Transplantation 2:24-30, 1 996). 

AAV belongs to the genus Dependovirus (family parvoviridae), and is 
also known as adenosatellite and dependovirus. These viruses are antigenically 
unrelated to adenovirus, but require the presence of adenovirus in order to 
replicate. There are at least four serotypes of AAV, all of which share common 
1 5 antigens. Dependoviruses are characterized in that they ( 1 ) are small and 

contain single-stranded DNA, (2) have virions of 18 to 26 nm in diameter, (3) 
are not enveloped, (4) are ether-resistant, (5) have capsids with cubic symmetry 
(with 32 capsomeres), and (6) replicate and assemble into virions in the nucleus 
of infected cells. AAV vectors for use in gene therapy methods contain AAV 
20 inverted terminal repeats (ITRs). 

Appropriate promoters and genes, such as genes encoding 6-EKLF 
polypeptides, are introduced into AAV vectors using standard methods in the 
art (see, e.g., Sambrook et al., eds., Molecular Cloning: A Laboratory Manual, 
2nd Edition, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 
25 1989). Any promoter that is sufficient to direct initiation of transcription in a 
hematopoietic cell, such as a hematopoietic stem cell or an erythrocyte 
precursor cell, can be used in the invention. For example, a preferred non- 
tissue specific promoter is the cytomegalovirus (CMV) promoter (DeBernardi 
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et al., Proc. Natl. Acad. Sci. USA 88:9257-9261, 1991, and references therein). 
Any appropriate hematopoietic cell-type specific promoter may also be used in 
the invention! For example, globin promoters, such as the P-globin promoter 
(Karlsson et al., Ann. Rev. Biochem. 54:1071-1 108, 1985, and references 
5 therein) (e.g., a p-globin promoter including a DNAse hypersensitivity site, 

such as HS2), may be used. 

The hematopoietic stem cells can be obtained from any appropriate 

source. Preferably, the cells are obtained from the patient into which they are 

to be transplanted after AAV-infection. Hematopoietic stem cells are obtained 
1 0 from patients and, after infection by AAV vectors, arc introduced back into 

patients using standard methods in the art (see, e.g., United States Patent 

5,061,620; Rubenstein et al., (eds.) Scientific American Medicine, Scientific 

American, Inc. (New York, 1978), and references therein). 

Production of AAV vectors containing genes encoding 5-EKLF 
1 5 polypeptides can be carried out using the semi-packaging cell line and vectors 

described below. 

Production of high titer, recombinant AAV 

A packaging cell line can be used to produce a high titer ( 1 0 7 - 1 0 8 
pfu/ml) stock of AAV-derived vectors containing genes encoding a 5-EKLF 

20 polypeptide (e.g., a 6-EKLF gene functionally linked to a CMV promoter). 
Such a "semi-packaging" cell line can be established with a silent episomal 
vector, such as pEBAVrc. This vector contains an EBV replication origin and 
an EBNA1 gene. This arrangement enables the vector to be established as an 
episome and to replicate to high copy numbers. The rep-cap genes of AAV are 

25 inserted into the vector with their natural control elements, so these genes are 
activated only in the presence of adenovirus early gene products. This 
arrangement provides a switch-system to regulate the expression of the rep 
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gene, which has been found to have a cytostatic effect. The vector also 
contains a hygromycin resistance gene to enable selection for cells that contain 
the vector. The plasmid sequences of the vector contain an ampicillin 
resistance gene and a bacterial replication origin for replicating the DNA in 
5 bacteria. 

An AAV vector containing a S-EKLF gene can be made in a similar 
vector, to generate pEBAVcmv/5-EKLF. This vector contains a CMV 
immediate-early transcriptional cassette containing a 8-EKLF gene, which is 
flanked by the two ITR sequences of AAV. 
1 0 To establish the semi-packaging cell line, HeLa cells in 1 00 mm culture 

dishes arc transfected with pEBAVrc. HeLa cells arc seeded 24 hours prior to 
transfection, and are allowed to reach 50-70% confluence at the time of 
transaction. Cells are washed once with serum-free DMEM (Gibco, 
Gaithersburg, MD). After removal of the medium, 8 ml of DMEM containing 
1 5 1 0% fetal calf serum is added to the culture on the next day. At 1 2 hours after 
this medium change, cells are treated with trypsin and passed to three 100 mm 
dishes in DMEM containing 10% fetal calf serum and 270 u/ml of Hygromcyin 
(Sigma). Special care should be taken to allow cells to settle at the bottom of 
the plate individually. Cells are selected with hygromycin for 10 to 1 5 days, or 
20 until individual colonies begin to grow. Colonies from 3 to 4 dishes are treated 
with trypsin and pooled into one 100 mm dish. Cells are kept at 60-70% 
confluence in medium containing 200 u/ml hygromycin. When 90% 
confluence is reached, cells are transferred into four 100 mm dishes in DMEM 
containing 10% DMEM without hygromycin. At 24 hours post-seeding, each 
25 1 00 mm dish of cells is transfected with 5 ug each of pEB A Vrc and 

pEBAVcmv/6-EKLF. At 3 hours after the addition of the transfection medium, 
wild-type adenovirus (HuAd5) is added to the cells at an MOI of 10, without 
removal of the transfection medium. At 12 hours after the transfection, the 
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medium is changed to DMEM containing 5% fetal calf serum and incubated for 
three days, or until most of the cells show cytopathic effects. All of the media, 
except 1 ml, is removed from each 100 mm dish. Cells are then scraped off the 
plate and transferred into a 10 ml screw-cap tube. This viral lysate is stored at - 
5 1 00°C, until further treatment. 

The viral lysate is thawed on ice, and sonicated for 10-15 minutes with 
pulse ultrasound. The tube is put back on ice periodically to prevent the sample 
from heating up. The viral lysate is then spun at 15,000 rpm for 10 minutes in 
a JL-20 centrifuge (Beckman). The supernatant is removed and aiiquoted into 
10 freezing wells of 10 ml each and stored at -100°C. For infecting hematopoietic 
stem cells, it is not necessary to remove or inactivate the adenovirus, since stem 
cells lack receptors for adenovirus. For infection of cells that are susceptible to 
adenovirus infection, adenovirus should be removed by chromatography or by 
CsCl gradient centrifugation. 
15 The titer of the virus can be determined by the rate of transduction of 

HeLa cells with the 6-EKLF gene. One, two, and three ml of the viral stock 
can be used to infect HeLa cells in 24-well plates. At 24 hours post-infection, 
the cells can be fixed with 0.1% glutaraldehyde at 4°C for 2 minutes, and 
washed thoroughly with DMEM. Cells can then be monitored for 6-EKLF 
20 expression, using standard methods. The transfection procedure is as follows. 
For each 100 mm dish, 8-10 ug DNA is dissolved in 2 ml of DMEM or 
transfection medium from BRL. 90-100 ul of lipofectamine (BRL) is 
suspended in 2 ml of DMEM. The lipid containing medium is mixed with the 
DNA containing medium slowly and dropwise, while the tube is gently rotated. 
25 After mixing well by gently rotating the tube, the transfection mixture is 

incubated for 30 minutes at room temperature. The mixture is then diluted with 
DMEM to a total volume of 8 ml. The mixture is then added to the cells, which 
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had been washed once or twice with DMEM without serum. Care should be 
taken to not allow the cells to dry between each wash. 

Purification of human hematopoietic stem cells 

Bone marrow is obtained from individuals following an IRB approved 
5 protocol. Mononuclear cells are purified using density gradient centrifugation 
using standard procedures. The mononuclear bone marrow cells are isolated by 
Ficoll/Hypaque density gradient centrifugation (average yield 1-2x1 0 8 cells). 
The use of magnetic activated celf sorting (M7^CS)"allowsW 
of CD34+ cells (Kato and Radbruch, Cytometry 14:384-392, 1993). The cells 
10 are labeled with anti-CD34+ antibodies conjugated with magnetic particles, and 
run through the magnetic separation column from Miltenyi Biotec (Sunnyvale, 
CA). Hematopoietic stem cells can be further enriched by selecting for and 
against various surface molecules according to principles outlined by Baum et 
al. (Proc. Natl. Acad. Sci. USA 89:2804-2808, 1992). The progenitors can be 
15 labeled with an FITC conjugated antibody that recognizes a different CD34+ 
epitope than the magnetic bead conjugated anti-CD34 antibody. Lin- cells are 
isolated by a negative selection using a cocktail of PE labeled antibodies that 
recognize myeloid, B cells, T cells, and NK cell markers (CD 2, CD 14, CD 19, 
CD 56). Thy+ cells are isolated by positive selection with biotinylated, anti- 
20 human Thy mouse monoclonal antibody that was labeled with streptavidin- 
conjugated, Texas red dye. 



Infection of hematopoietic stem cells 

Ten thousand CD34+, Lin-, and Thy+ cells are suspended in 200 ul of 
DMEM without serum and are dispensed into a single well of a 96 well plate. 
25 Ten microliters of the recombinant virus (AAV cmv/6-EKLF; 1 0" u/ml) arc 
then added to the cells and incubated at 37°C for 16 hours. The cells are then 



WO 97/47306 



PCTYUS97/10176 



-17- 

washed 3 times with DMEM without serum and can be plated into long term 
culture as described by Eaves et al. (J. Tiss. Cult. Meth. 13:55-62, 1991) for 
further^analysis. 

Analysis of cells plated into long term culture 

5 After 6 weeks in long term culture, cells arc removed and plated in 

methylcellulose for clonogenic progenitor assays (Eaves and Eaves, Blood 
52:1 196-1210, 1978). BFU-E are picked after 14 days, DNA is extracted, and 
PCR is performed with primers specific foTO-EKLF. PCR products are run on 
agarose gels, blotted to nitrocellulose, and hybridized with radiolabeled, 6- 

1 0 EKLF sequences. 

S-EKLF Polypeptide Repression 

6tEKLF polypeptides can be used to produce antibodies against 6- 
EKLF. Such antibodies can be used to monitor the levels of expression of 6- 
EKLF in patients treated by the gene therapy methods of the invention. In 
1 5 general, 6-EKLF polypeptides can be produced by transformation of a suitable 
host cell with all or part of a 6-EKLF-encoding nucleic acid fragment (e.g., a 
cDNA fragment) in a suitable expression vehicle. 

Those skilled in the field of molecular biology will understand that any 
of a wide variety of expression systems can be used to provide the recombinant 
20 protein. The precise host cell used is not critical to the invention. The 6-EKLF 
protein can be produced in a prokaryotic host (e.g., E. coif) or in a eukaryotic 
host (e.g., Saccharomyces cerevisiae, insect cells, e.g., Sf21 cells, or 
mammalian cells, e.g., COS-1, NIH 3T3, or HeLa cells). Such cells are 
available from a wide range of sources (e.g., the American Type Culture 
25 Collection, Rockland, MD; also, see, e.g., Ausubel et al., Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, 1994). The method of 
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transformation or transfection and the choice of expression vehicle will depend 
on the host system selected. Transformation and transfection methods are 
described, e.g., in Ausubel et al. {supra); expression vehicles can be chosen 
from those provided, e.g., in Cloning Vectors: A Laboratory Manual (P.H. 
5 Pouwels et al., 1985, Supp. 1 987). 

One particular expression system is the baculovirus system (using, for 
example, the vector pBacPAK9) available from Clontech (Pal Alto, CA). 
Alternatively, a 6-EKLF polypeptide can be produced by a stably-transfected 
mammalian cell line. A number of vectors suitable for stable transfection of 
1 0 mammalian cells are available to the public, e.g., see Pouwels et al. (supra); 
methods for constructing such cell lines are also publicly available, e.g., in 
Ausubel et al. (supra). In one example, cDNA encoding the 6-EKLF 
polypeptide is cloned into an expression vector which includes the 
dihydrofolate reductase (DHFR) gene. Integration of the plasmid and, 
1 5 therefore, the 6-EKLF polypeptide-encoding gene into the host cell 

chromosome is selected for by including 0.01-300 fjM methotrexate in the cell 
culture medium (as described in Ausubel et al., supra). This type of dominant 
selection can be accomplished in most cell types. Recombinant protein 
expression can be increased by DHFR-mediated amplification of the 
20 transfected gene. Methods for selecting cell lines bearing gene amplifications 
are described in Ausubel et al. (supra); such methods generally involve 
extended culture in medium containing gradually increasing levels of 
methotrexate. DHFR-containing expression vectors commonly used for this 
purpose include pCVSEH-DHFR and P AdD26SV(A) (described in Ausubel et 
25 al., supra). Any of the host cells described above or, preferably, a DHFR- 
deficient CHO cell line (e.g., CHO DHFR" cells, ATCC Accession No. CRL 
9096) are among the host cells preferred for DHFR selection of a stably- 
transfected cell line or DHFR-mediated gene amplification. 
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Once the recombinant 6-EKLF polypeptide is expressed, it can be 
isolated, e.g., using affinity chromatography. In one example, an anti-6-EKLF 
polypeptide antibody (e.g., produced as described herein) can be attached to a 
column and used to isolate the 6-EKLF polypeptide. Lysis and fractionation of 
5 6-EKLF polypeptide-harboring cells prior to affinity chromatography can be 
performed by standard methods (see, e.g., Ausubel et al., supra). Once 
isolated, the recombinant protein can, if desired, be further purified, e.g., by 
high performance liquid chromatography (see, e.g., Fisher, Laboratory 
Techniques In Biochemistry And Molecular Biology, eds., Work and Burdon, 

10 Elsevier, 1980). 

Polypeptides of the invention, particularly short 6-EKLF polypeptide 
fragments (e.g., fragments containing zinc fingers), can also be produced by 
chemical synthesis (e.g., by the methods described in Solid Phase Peptide 
Synthesis, 2nd cd., 1984, The Pierce Chemical Co., Rockford, 1L). 

1 5 These general techniques of polypeptide expression and purification can 

also be used to produce and isolate useful 6-EKLF fragments (e.g., fragments 
containing one or more zinc fingers, or portions thereof) or analogs. 

Antibodies a painst 6-EKLF 

To generate 6-EKLF-specific antibodies, a 6-EKLF coding sequence 

20 (e.g., a sequence encoding a 6-EKLF zinc finger DNA binding domain) can be 
expressed as a C-terminal fusion with glutathione S-transferase (GST) (Smith 
et al., Gene 67:31-40, 1988). The fusion protein can be purified on 
glutathione-Sepharose beads, eluted with glutathione cleaved with thrombin (at 
a engineered cleavage site), and purified to the degree necessary for 

25 immunization of rabbits. 

Primary immunizations can be carried out with Freund's complete 
adjuvant and subsequent immunizations with Freund's incomplete adjuvant. 
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Antibody titers can be monitored by Western blot analysis and 
immunoprecipitation using the thrombin-cleaved 6-EKLF protein fragment of 
the GST-6 EKLF fusion protein. Immune sera can be affinity purified using a 
CNBr-Sepharose-coupled 6-EKLF polypeptide. Antiserum specificity is 
5 determined using a panel of unrelated GST proteins. 

As an alternate or adjunct immunogen to GST fusion proteins, peptides 
corresponding to relatively unique hydrophilic regions of 6-EKLF can be 
generated and coupled to keyhole limpet hemocyanin (KLH) through an 
introduced C-terminal lysine. Antiserum to each of these peptides can be 
10 similarly affinity purified on peptides conjugated to BSA, and specificity tested 
in ELISA and Western blots using peptide conjugates, and by Western blot and 
immunoprecipitation using 6-EKLF expressed as a GST fusion protein. 

Alternatively, monoclonal antibodies can be prepared using 6-EKLF 
polypeptides and standard hybridoma technology (sec, e.g., Kohler et al., 
15 Nature 256:495, 1975; Kohler et al., Eur. J. Immunol. 6:5 1 1 , 1 976; Kohler et 
al., Eur. J. Immunol. 6:292, 1976; Hammerling et al., In Monoclonal Antibodies 
and T Cell Hybridomas, Elsevier, NY, 1981; Ausubel et al., supra). Once 
produced, monoclonal antibodies can also be tested for specific 6-EKLF 
recognition by Western blot or immunoprecipitation analysis (e.g., by the 
20 methods described in Ausubel et al., supra). Antibodies that specifically 

recognize 6-EKLF are considered to be useful in the invention; such antibodies 
can be used, e.g., in an immunoassay to monitor the level of 6-EKLF produced 
by a mammal (e.g., a human treated using a gene therapy method of the 
invention). 

25 Preferably, antibodies of the invention are produced using fragments of 

the 6-EKLF polypeptide that appear likely to be antigenic, as determined by 
criteria such as high frequency of charged residues. In one specific example, 
such fragments are generated by standard techniques of PCR and cloned into 
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the pGEX expression vector (Ausubel et al., supra). Fusion proteins are 
expressed in E. coli and purified using a glutathione agarose affinity matrix as 
described in Ausubel et al. (supra). To attempt to minimize the potential 
problems of low affinity or specificity of antisera, two or three such fusions are 
5 generated for each protein, and each fusion is injected into at least two rabbits. 
Antisera are raised by injections in a series, preferably, including at least three 
booster injections. 

Insertion of a p-EKT,F Binding Site Activates Transcription of the '5-GT6Eih 
Promoter 

10 Experiments demonstrating that the presence of the p-EKLF binding site 

leads to activation of a downstream 5-globin promoter were carried out as 
follows. 

Plasmid constructions 

The construction of plasmid HS2 p/Luciferase has been described 

15 (Donze et al., J. Biol. Chem. 270:1955, 1995). The plasmid contains the 1.5 kb 
Kpnl-Bglll human Locus Control Region HS2 fragment upstream of the -265 to 
+48 human p-promoter driving the Luciferase reporter gene of plasmid pGL2- 
Basic (Promega Corp.). HS2 5/Luciferase was constructed by PGR 
amplification of the -265 to +48 region of the 6-promoter with Bglil ends 

20 attached to the primers. The template used was a human globin locus subclone 
containing the entire 6-gene, and the primer sequences (BglW sites underlined) 
were: upstream 

5'-CTCGAGGCTAGCAGAI£IGCAAAAATGAAACTAGA- 
3' (SEQ ID NO:l); downstream, 5'- 

CTCGAGGCTAGCAQAICICTGTTTGAGGTTGCTAGTGA-3' (SEQ ID 
25 NO:2). The PCR product was digested with BgUl and cloned into the plasmid 
HS2/Luc (Caterina et al., Nucleic Acids Res. 22:2383, 1994). Modified 6 CA °-' 
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and 6 CAC " 2 promoters were made by the megaprimer mutagenesis method 
(Sarkar et al., BioTechniques 8:404, 1990; Barik et al., BioTechniques 10:489, 
1991; Aiyar et al., BioTechniques 14:366, 1993), using the same outside 
oligonucleotides listed above with the following mutagenic oligonucleotides: 
5 6 CA<M , 5'- 

TTTTC ATTCTC AC AAACTA ACCACACCCIGCTTATCTT A A AC 

CA-3* (SEQ ID NO:3); 6 CA ° 2 , 5'- 
TCATTTTTCATTCTCACAAACCACACCC 

1CCCTGCTTATCTTAAACCAA-3' (SEQ ID NO:4). The consensus EKXF 
10 binding sites are underlined and in bold, and these fragments were cloned into 
HS2/Luc, as is described above. 

To construct the HS2 6-P gene plasmids, the 3.0 kb Xbal-Sphl fragment 
of the 6-globin gene (from -400 to about 1 kb downstream of the poly A site) 
was first subcloned into pUC19. The modified 6-promoters (amplified from 
15 the -400 Xbal site to the Sail site at - 1 3) were then synthesized by megaprimer 
mutagenesis using the same mutagenic oligos listed above and outside primers 
5'-AGTTTAAACTGCAGCAATAG-3' (SEQ ID NO:5) (starting at -460 of the 
6-promoter), and 5'-CTTCTCCTCAGGAGTCAG-3' (SEQ ID NO:6), (just 
downstream of the translation start site). These PCR fragments were cut with 
20 Xbal and Sail, and used to replace the corresponding wild type fragment in the 
6-gene pUC19 subclone. The & GAL4 -promoter was constructed in the same 
fashion, using the mutagenic oligonucleotide 5'- 
GAAGGTTCATTTTTCATTCTCCfifiAGGA 

CAGTCCTCCGGC TTATCTTAAACCAACCTGC-3' (SEQ ID NO:7), with 
25 the consensus GAL4 binding site (Giniger et al., Cell 40:767, 1985) underlined 
and in bold. Each of these 6-genes was cut out of the pUC19 vector with Xbal 
and Hindill, blunted with S 1 nuclease, and cloned into plasmid 5'HS2 (K-P) P" 
(Caterina et al., Nucleic Acid Res. 22:1006, 1994), which was cut with Gal 
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and blunted with SI nuclease. All PCR amplified regions in each of these 
constructs were completely verified by dideoxy sequencing using the 
Sequenase kit (United States Biochemical). 

The pCIneo-GAL4 (M47 /EKLF plasmid was constructed in two steps: A 

5 0.8 kb Ncol-Msll cDN A fragment containing the coding sequence (amino acids 
2-275) of the murine EKLF activation domain (from plasmid pSG5-EKLF 
(Miller et al., Mol. Cell. Biol. 13:2776, 1993)) was blunted with SI nuclease 
and cloned in frame into the GAM DNA binding domain expression plasmid 
pBXGl. The pBXGl/EKLF plasmid was cut with HindlU and BamHl, the 1.3 

10 kb GAL4 (M47 /EKLF fragment was isolated, blunted with SI nuclease, and 
cloned into pClneo (Promega Corp.) cut with Smal. As a control, the 0.5 kb 
GAL4 binding domain HindlU-BamUl fragment from pBXGl was cloned into 
pClneo, as is described above (pCIneo-GAL4 (M47) ). 

Mouse etythroleukemia (MEL) cell transfections 

15 MEL cells were maintained in DMEM containing 10% fetal bovine 

serum. For transient transfections, cells were washed once and resuspended to 
10 8 cells/ml in RPMI 1640 medium without serum. For transient luciferase 
assays, 40 pg of HS2 promoter/Luciferase plasmids were mixed with 5 ug 
internal control plasmid CMV-pgal (Clontech) in 100 ul phosphate buffered 

20 saline (PBS) and placed in a 0.4 cm gap electroporation cuvette (Bio-Rad). 

5x1 0 7 MEL cells in 0.5 ml RPMI 1640 medium (without serum) were added to 
each cuvette, and the cells were electroporated at 260 V and 960 uF in a Bio- 
Rad Gene Pulser. The contents of each cuvette were transferred into a 100 mm 
plate containing 10 ml DMEM/10% FBS, and incubated for 8 hours. After 8 

25 hours, DMSO was added to 1 .65% to induce erythroid differentiation. This 8 
hour pre-incubation, before DMSO induction, increased cell viability and gene 
expression levels several fold when compared to cells induced immediately 
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after electroporation. Cultures were incubated for three days and then cell 
extracts were assayed for luciferase and P-galactosidase as described (Caterina 
et al., Nucleic Acids Res. 22:2383, 1994; Donze et al., J. Biol. Chem. 
270:1955, 1995). Transient transfections were performed three times in 
5 triplicate. 

Transfection of MEL cells to produce G41 8 resistant populations was 
performed essentially as described by Collis et al. (EMBO J. 9:233, 1990). 
Twenty micrograms of HS2 6-P plasmids (with 6, 6 CAC '"\ or 6 CM%2 promoters) 
were mixed with a 1/10 molar ratio of plasmid pgk-neo (the murine 

1 0 phosphoglycerate kinase promoter driving neomycin phosphotransferase) . 

Both plasmids were linearized with Kpnl, precipitated, and dissolved in 100 uJ 
PBS. The plasmid mix was placed into a 0.4 cm electroporation cuvette, and 
2x1 0 7 MEL cells (prepared as is described above) in 0.5 ml were added. Cells 
were electroporated at 200 V and 960 uF, and transferred to 30 ml DMEM/10% 

1 5 FBS in a 75 cm 2 culture flask. After 48 hours, G41 8 was added to a final 

concentration of 300 ug/ml, and resistant cell populations were selected for two 
weeks. Cells were then induced with 1.65% DMSO for three days, and total 
RNA was extracted as described (Chomczynski et al., Anal. Biochem. 162: 156, 
1987). The HS2 6 GAL4 -P experiments were performed as above, using a 1:10 

20 molar ratio of the pClneo plasmids, which were linearized with Xmril. 

Analysis of relative levels ofb- and ^-transcripts 

The relative levels of 6- to P-globin transcripts were determined by the 
Single Nucleotide P_rimer Extension (SNuPE) assay, which is based on 
detecting single known nucleotide differences in allelic RNAs (Szabo et al., 
25 Genes Dev. 9: 1 857, 1995). Briefly, RNA from MEL cells transfected with 
HS2 6-P constructs was amplified by RT-PCR (cDNA cycle kit, Invitrogcn) 
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using oligo dT to prime the cDNA reaction. The PCR primers were: upstream, 
5-TGTTCACTAGCAA 

CCTCAAAC-3' (SEQ ID NO: 8); and downstream, 5'-TGAAGTTCTCAGGAT 
CCACGT-3' (SEQ ID NO:9). There are no differences in the 6- and p-globin 
5 sequences in these regions and the primers-do not amplify mouse globin 

mRNAs. These oligos specifically amplify corresponding 341 bp fragments of 
both the human 6- and p-globin cDNAs, and the fragments were purified by 
agarose gel electrophoresis. The detection oligo, 5'- 

CTTCTCCTC AGG A'GTC AG-3 ! (SEQTD NGT6) , hybri'dizes to identical sense 
1 0 strand sequences of both 6- and P-globin cDN A PCR products (corresponding 
to nucleotides +77 to +60 of each message), and is extended to base +59 by 
oc 32 P-dATP in the 6 reaction or by oc 32 P-dGTP for the P reaction. Labeled 
products were resolved on a 15% polyacrylamide/8M urea gel, and bands were 
quantitated on a Molecular Dynamics Model 425 Phosphorimager. Linearity of 
1 5 the assay was verified as described (Szabo et al., Genes Dev. 9: 1 857, 1 995) by 
mixing experiments using RNA from MEL cells transfected with either HS2 p 
or HS2 P-promoter/6-gene as sources containing the individual messages. 

Experimental Results 

To test the feasibility of using a modified EKLF to activate 6-globin 
20 gene expression, we determined whether a 6-globin promoter containing a P- 
globin EKLF binding site would increase expression by recruiting endogenous 
cellular P-EKLF in transient transfection assays. We replaced the defective P- 
EKLF binding site in the 6-globin promoter (TGAAACCCI from -89 to -81 , 
Fig. IB) with the consensus CCACACCCT, to construct the plasmid HS2 
25 6 CAC ''/Luciferase. We also constructed a modified 6-globin promoter that 
contained the CCACACCCT sequence at -93 to -85, so that the P-EKLF 



WO 97/47306 



PCTAJS97/I0176 



-26- 

binding site would be positioned the same distance from the transcription start 
site as it is in the p-globin promoter (HS2 6 CAC " 2 /Luciferase). 

Fig. 3B illustrates the relative expression levels of these modified 6- 
promoter/Luciferase reporter constructs (Fig. 3A) when linked to a Locus 

5 Control Region HS2 fragment and transiently transfected into mouse 

erythroleukemia (MEL) cells. In this assay, HS2 6/Luc was expressed at 10% 
of the level of HS2 p/Luc. When the consensus p-EKLF site was placed in the 
6-globin promoter at the -81 to -89 site (HS2 6 CAC -'/Luc), expression was 
increased approximately 5 fold, to 56% of HS2 p/Luc expression. HS2 6 CAC " 

1 0 2 /Luc, with the consensus P-EKLF site at -85 to -93, expressed the reporter 
gene at 99% of HS2 p/Luc. These results demonstrate that the major defect of 
the 6-globin gene proximal promoter is the lack of a consensus P-EKLF 
binding site. 

In addition to promoter defects, low 6-globin expression has been 
1 5 attributed to other differences between the 6- and p-globin genes. The 6-gene 
lacks intragenic enhancers, which augment p-globin expression (Kosche et al., 
Nucleic Acids Res. 13:7781, 1985; Behringcr et al., Proc. Natl. Acad. Sci. USA 
84:7056, 1987), and 6-globin mRNA is less stable than P-globin mRNA (Ross 
et al., J. Mol. Biol. 167:607, 1983). To assess the effects of these differences 
20 on 6 CAC "' and 6 CAC * 2 expression, we constructed complete 6-globin genes (see 
methods) containing the 6, 6 CAC1 , and 6 CA ° 2 promoters (Fig. 4 A). These 6- 
globin genes were used to produce HS2 6-P constructs. The plasmids were 
then co-transfected into MEL cells with a pgk-neo selectable marker and G418 
resistant populations were selected. After differentiation was induced with 
25 DMSO, RNA was extracted and the relative levels of 6- and P-globin mRN As 
were determined by the Single Nucleotide P_rimer Extension assay (SNuPE) 
(Szabo et al., Genes Dev. 9: 1 857, 1995). Table 1 shows the results of these 
experiments. The level of 6-globin mRNA (6/6+P) in MEL cells was 3.0 ± 
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1.3% for HS2 5-P, 14.9 ± 7.4% for HS2 6 CAC '^ and 22.0 ± 9.0% for HS2 
§cAc-2_p These results demonstrate that insertion of a consensus p-EKLF 
binding site in the 6-globin promoter significantly enhances 6-globin gene 
expression in a construct that contains the entire 6-globin gene. A 
5 representative example of the SNuPE assay is illustrated in Fig. 4B; 6/6+p- 
globin mRNA in this experiment was 3.6% for HS2 6-p, 24.7% for HS2 6 CACM - 
P, and 34.8% for HS2 6 CAC ' 2 -p. As a control, 6- and p-globin mRNA levels 
were determined for human reticulocyte RNA (Fig. 4B); the 6/6+P level was 
0.6 ± 0.2%. 

10 These results show that p-EKLF binding to the 6-globin gene promoter 

significantly stimulates expression; however, the experiments do not exclude 
the possibility that other CACCC binding factors are responsible for this 
increase. To confirm that the p-EKLF activation domain can enhance 6-globin 
gene expression, we constructed an HS2 6 OAL4 -P plasmid in which the 

1 5 consensus G AL4 binding site replaced sequences from -80 to -99 of the 6- 
globin gene promoter (Fig. 5 A). The coding sequence of the murine p-EKLF 
activation domain (amino acids 2-275) was fused in frame to the GAL4 DNA 
binding domain (amino acids 1-147), and the fusion fragment was subcloned 
into the pClneo expression vector (Fig. 5B). MEL cells were co-transfected 

20 with HS2 6 GAL4 -p plus pClneo-GAL4 (M47) or HS2 6 GAL4 -p plus pCIneo-GAL4 (1 . 
l47 /p-EKLF. Stable pools of G418 resistant cells were selected and, after 
DMSO induction, 6- and P-globin mRNA levels were determined by the 
SNuPE assay described above. 

The data in Table 2 demonstrate that the level of human 6/6+p mRNA is 

25 9.9 ± 2.5% in cells containing HS2 6 GAL4 -p plus pCIneo-GAL4 (M47) and 27.8 ± 
7.1°/ in cells containing HS2 6 GAU -P plus pCIneo-GAL4 (I . l47 /p-EKLF. A 
representative example is illustrated in Fig. 5C; the level of human 6/6+P 
mRNA is 4.4% in cells containing HS2 6-P plus pClneo-GAL4 {M47) , 13.3% in 
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cells containing HS2 6 aAL4 -p plus pCIneo-GAL4 (M47) , and 36.6% in cells 
containing HS2 6 CAL4 -p plus pCIneo-GAL4 (M47 /P-EKLF. These results 
demonstrate that a modified p-EKLF can significantly increase 6-globin gene 
expression. 

5 The 3-fold increase (4.4% to 13.3%) of 6-globin gene expression 

observed when 6 promoter sequences from -80 to -99 are replaced with a 
GAL4 binding site suggests that this region binds a factor that represses 
expression. A 3-fold increase is also observed when the -80 to -99 region of 
the 6-globin promoter is deleted in an HS2 6-p globin construct. 

1 0 Table 1 . Relative Levels of 6-globin Transcripts From MEL Transfections 
and Human Reticulocyte Samp les as % of Total Human 6 + B Transcripts. 

HS2 6-P HS2 6 CAC I -B HS2 6 CAC: :2 -P Reticulocyte 



Exp. 1 


3.6 


24.7 


34.8 


0.5 


15 Exp. 2 


1.9 


10.0 


15.0 


0.7 


Exp. 3 


1.6 


5.8 


12.3 


0.8 


Exp. 4 


4.7 


19.0 


25.7 


0.3 


Avg. 


3.0+/- 1.3 


14.9+/- 7.4 


22.0 +/- 9.0 


0.< 



20 — 

Expression of modified 6-globin genes in MEL cells stably transfected with 
HS2 6-p constructs. These constructs were transfected into MEL cells with a 
pgk-neo selectable marker, and G418 resistant populations were selected. The 
cells were induced to differentiate with DMSO, and RNA was extracted for 
25 analysis, the relative levels of 6- and p-globin mRNAs were determined by the 
Single Nucleotide Erimer Extension (SNuPE) assay and quantitated by 
phosphorimager analysis. 
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Table 2. Relative Levels of 6-gIobin Transcripts From MEL Cells 
Transfected with HS2-6 CAW -p and GAL4 (M47) Expression Constructs as % 
of Total 6 + p Transcripts 



5 HS2 8 GAU -p HS2 6 tiAL4 p 

+ GAL4 (M4 7) +GAL4 ( , H7 /P-EKLF 

Exp. 1 13.3 36.6 

Exp. 2 8.4 30.2 

Exp. 3 6.7 16.8 

10 Exp. 4 11.3 27.3 



Avg. 9.9+7-2.5 27.8+/- 7.1 



GAL4/p-EKLF activates 6-globin gene expression in an HS2 5GAL4-P 
15 construct. HS2 6GAL-P was stably transfected into MEL cells with either 
pCIneo-GAL4 (M47) or pCIneo-GAL4 (M47 /P-EKLF, and relative transcript 
levels were determined by the SNuPE assay. These results demonstrate that a 
modified P-EKLF can significantly increase 6-globin gene expression. 



All publications referred to herein are incorporated by reference in their 
20 entirety. Other embodiments are in the following claims. 
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<ii) TITLE OF THE INVENTION: DELTA -ERYTHRO ID KRUPPEL - LIKE 
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(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(BJ STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 

(v) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 
<B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97/ 

(B) FILING DATE: 13-JUN-97 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/019,769 

(B) FILING DATE: 14-JUN-1996 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Elbing, Karen L. 

(B) REGISTRATION NUMBER: 35,236 

(C) REFERENCE/DOCKET NUMBER: 04005/014WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-428-0200 

(B) TELEFAX: 617-428-7045 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CTCGAGGCTA GCAGATCTGC AAAAATGAAA CTAGA 3 5 

(2) INFORMATION FOR SEQ ID NO : 2 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
CTCGAGGCTA GCAGATCTCT GTTTGAGGTT GCTAGTGA 3 8 

(2) INFORMATION FOR SEQ ID NO : 3 : 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
< D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TTTTCATTCT CACAAACTAA CCACACCCTG CTTATCTTAA ACCA 44 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
TCATTTTTCA TTCTCACAAA CCACACCCTC CCTGCTTATC TTAAACCAA 4 9 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGTTTAAACT GCAGCAATAG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTTCTCCTCA GGAGTCAG 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GAAGGTTCAT TTTTCATTCT CCGGAGGACA GTCCTCCGGC TTATCTTAAA CCAACCTGC 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TGTTCACTAG CAACCTCAAA C 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TGAAGTTCTC AGGATCCACG T 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TGTGGAGCCA CACCCTAGGG TTGGCCAATC TACTCCC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TCACAAACTA ATGAAACCCT GCTTATCTTA AACCAAC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCACAAACTA ACCACACCCT GCTTATCTTA AACCAAC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCACAAACCA CACCCTCCCT GCTTATCTTA AACCAAC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 amino acids 
<B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Gly His Glu Gly Cys Gly Lys Ser Tyr Ser Lys Ser Ser His Leu 

15 io 15 

Lys Ala His Leu Arg Thr His 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Ser Trp Asp Gly Cys Asp Trp Arg Phe Ala Arg Ser Asp Glu Leu 

15 10 15 

Thr Arg His Tyr Arg Lys His 
20 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Cys Gly Leu Cys Gly Leu Cys Pro Arg Ala Phe Ser Arg Ser Asp His 

15 10 15 

Leu Ala Leu His Met Lys Arg His 
20 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCGGAGGACA GTCCTCCGG 
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1 . Use of a nucleic acid encoding a 6-EKLF polypeptide in the preparation 
of a medicament for inducing 6-globin gene expression in a cell. 

2. A 6-EKLF polypeptide. 

3. A nucleic acid encoding a 6-EKLF polypeptide. 

5 4. A cell that contains a nucleic acid encoding a 6-EKLF polypeptide. 

5. The cell of claim 4, wherein said cell is a hematopoietic stem cell. 

6. The cell of claim 5, wherein said cell is an erythrocyte precursor cell. 

7. A vector containing a nucleic acid encoding a 6-EKLF polypeptide. 

8. The vector of claim 7, wherein said vector is an adeno-associated viral 
1 0 (AAV) vector or a retroviral vector, 

9. A method for identifying a 6-EKLF polypeptide, said method 
comprising contacting a nucleic acid containing the sequence 5-TGA A AC CCT-3' 
or the sequence 5'-CTA ATG AAA-3' with a candidate polypeptide that has been 
modified such that it differs from wild type p-EKLF by at least one amino acid and 

1 5 identifying said polypeptide by its ability to bind to said nucleic acid. 

10. The method of claim 9, wherein said amino acid is in a p-EKLF zinc 
finger that binds DNA. 
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